نتایج جستجو برای: reward processes

تعداد نتایج: 554393  

Journal: :bulletin of the iranian mathematical society 2011
k. khorshidian a. r. soltani

Multivariate reward processes with reward functions of constant rates, defined on a semi-Markov process, first were studied by Masuda and Sumita, 1991. Reward processes with nonlinear reward functions were introduced in Soltani, 1996. In this work we study a multivariate process , , where are reward processes with nonlinear reward functions respectively. The Laplace transform of the covar...

Journal: :Neuropharmacology 2020

Journal: :Journal of Mathematical Analysis and Applications 1993

Journal: :CoRR 2012
Christos Dimitrakakis

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is a relation among those tasks, then the information gained during execution of one task has value for the execution of another task. Consequently, the agent is intrinsically motivated to explore its environment beyond the degree necessary to solve the current task it has at han...

Journal: :Math. Oper. Res. 2008
Jia Yuan Yu Shie Mannor Nahum Shimkin

We consider a learning problem where the decision maker interacts with a standard Markov decision process, with the exception that the reward functions vary arbitrarily over time. We show that, against every possible realization of the reward process, the agent can perform as well—in hindsight—as every stationary policy. This generalizes the classical no-regret result for repeated games. Specif...

Journal: :Theoretical Computer Science 2014

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2023

In robust Markov decision processes (MDPs), the uncertainty in transition kernel is addressed by finding a policy that optimizes worst-case performance over an set of MDPs. While much literature has focused on discounted MDPs, average-reward MDPs remain largely unexplored. this paper, we focus where goal to find average reward set. We first take approach approximates using prove value function ...

Journal: :Philosophical Transactions of the Royal Society B: Biological Sciences 2014

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید